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We propose the Bi-Event Subtraction Technique (BEST) as a method of modeling and subtracting 
large portions of the combinatoric background during reconstruction of particle decay chains at 
hadron colliders. The combinatoric background arises when it is impossible to know experimentally 
which observed particles come from the decay chain of interest. The background shape can be 
modeled by combining observed particles from different collision events and be subtracted away, 
greatly reducing the overall background. This idea has been demonstrated in various experiments 
in the past. We generalize it by showing how to apply BEST multiple times in a row to fully 
reconstruct a cascade decay. We show the power of BEST with two simulated examples of its 
application towards reconstruction of the top quark and a supersymmetric decay chain at the Large 
Hadron Collider. 



The Large Hadron Collider (LHC) is up and running since 2009. Many models of particle physics beyond th? 
Standard Model (SM) predict new particles which can be tested at the LHC. Heavy colored objects are expected 
to be produced at the LHC, followed by a chain of subsequent decays, according to such new models. Thus, 
we must fully or partially reconstruct these cascade decays from the particles which can be detected. However, 
reconstructions of these decays become experimentally difficult because it is impossible to know which particles 
come from the cascade decay we wish to reconstruct. The inevitable inclusion of particles which do not come 
from the cascade decay of interest is referred to as combinatoric background. 

This combinatoric background can be removed easily in some cases by powerful subtraction techniques. For 
instance, the Z boson can decay into oppositely charged, same flavored leptons: Z — > e + e~ /fi + fi~. Leptons 
are easy to detect in the collider setting, and their charges can easily be measured. To reconstruct the Z 
boson from these leptons, it is easy to collect a sample of Opposite-Sign Same-Flavor (OSSF) lepton pairs 
and construct the dilepton invariant mass for each pair. To model the combinatoric background, a sample 
of Opposite-Sign Opposite-Flavor (OSOF) lepton pairs is selected as well. These OSOF lepton pairs cannot 
possibly both come from a single Z boson, and so they model the combinatoric background well. Performing 
the OSSF— OSOF subtraction of the invariant mass distributions (possibly using some normalization factor c), 
h OSSF ~ OSOF (mu) = h OSSF (mu) — ch OSOF (mu) , yields a distribution which shows a clear peak of the Z boson 
mass. 

However, such subtraction techniques are not available for jets, whose charges and flavors cannot so easily 
be determined. Thus, we introduce the Bi-Event Subtraction Technique (BEST) in which the combinatoric 
background of jets is modeled by combining jet information from a different event (or bi-event). This technique of 
modeling the combinatoric background by combining information from different events has been used before [1] . 
However, here we generalize it, by applying it to jets. Moreover, we have shown that it can be used multiple 
times for the same decay chain reconstruction. 

The basic idea of BEST can be demonstrated for the reconstruction of the W boson decaying into two jets. 
For this case, a signal may be seen if a sample of jet pairs is collected for each event to construct the dijet 
invariant mass distribution, h sarac (m.jj). Here, the "same" suggests that the jet pairs come from the same event. 
Some of the jet pairs in the same event distribution may come from a single W boson decay in the events, while 
other jet pairs will be combinatoric background. By collecting another sample of jet pairs where each jet comes 
from a different event, the bi-event distribution, h hl (rrijj), can be formed. This bi-event distribution will have 
no jet pairs which come from a single W boson. Thus, this bi-event distribution models a large amount of the 
combinatoric background well. The h hl (rrijj) distribution can be normalized to the h same (rrijj) distribution in 
the region of pure background (well away from the W boson mass peak). For instance, the normalization factor 
can be calculated as 
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This normalization factor can be used when the shapes of these distributions are very close in this region. If the 
shapes of these distributions are not close, it could be due to some new physics. For instance, and additional 
resonance in the h same (rrijj) distribution could cause a mismatch in the shapes. However, it would be easy 
enough to recalculate the normalization taking an overall range which excludes the additional resonance. It 
should be noted that one needs a detailed systematic study of the shape from different physics processes. This 
is beyond the scope of this paper. 
Finally, the BEST is performed: 

^ BEST K) = h—fa,) - Cf^h bi ( mjj ). (2) 

The resulting dijet distribution shows a W boson mass peak with most of the combinatoric background removed. 

If we wish to reconstruct decay chains involving these W bosons, we can take BEST even further. For 
instance, we can completely reconstruct the top quark from the decay chain t —> bW — > bjj. We can apply 
BEST again while combining the b jets with the reconstructed W bosons in order to reconstruct the top quark. 
However, this requires a more general application of BEST than has been used before. 

For this example, we will refer to the same-event histograms by denoting the jets in the subscript as j and 



b for jets and 6-jets respectively. For the bi-event histograms, we denote the jets in the subscript as f and b^. 
Thus we now denote our histograms and normalization factor from Eqs. and |2| as: 

h sanlc (mjj) = hjj(Mjj), (3a) 

h hi ( mjj ) = hjfiMjj), (3b) 

Cf* ST ^Cf ST *\ (3c) 
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To combine the reconstructed W bosons with the 6-jets to reconstruct the top quarks, we will need the 
following four additional histograms in order to perform two applications of BEST: hbjj(mbjj), hbjj'irribjj), 
hb'jj(nibjj), and hb'jj'(rribjj). We perform the first BEST using the normalization factor calculated above in 
Eq. 0: 

h ^33 ST#1 ( mb n ) = h bn ( m bj 3 ) ~ Cf/ ST#1 h b jj> (m bjj ) (4a) 
hf4 ST#1 (m 6ji ) = h'jj(m b jj) - Cf/ ST#1 /i^ w v(TO fcij ) (4b) 

Next we calculate another normalization factor for the second BEST which involves the combinatoric background 
of the &-jets. Once again, the range of this normalization factor is aimed at the region of pure background away 
from the top quark mass peak. Thus, it is calculated as: 
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With this normalization factor, we can finally perform the second BEST: 

C ST# W = - ^ ST#2 ^f T# \m bjj ). (6) 

Here, the resulting histogram will show a clean top quark mass peak with most of the combinatoric background 
removed. To clean up the resulting distribution even more, other subtraction techniques can also be employed, 
such as a sideband subtraction for the W boson reconstruction. Each additional subtraction will double the 
number of initial histograms which are needed for all of the subtractions. 

We demonstrate this powerful technique by using it to extract W — > jj for (i) tt events at y/s — 7 TeV and 
(ii) SUSY events at y/s = 14 TeV within LHC simulations. 

For the ti events, we generate hard scattering LHC collision events using ALPGEN 2|, perform the cascade 
decays with PYTHIA [3J, and perform a LHC detector simulation using PGS4 [3]. The VF+jets events are the main 
source of background for finding the top quark, so we generate these events in the same way. This background 
is mixed in randomly, according to production cross-sections, with our tt events. After PGS4 is finished with 
these events, we select events for analysis with the following cuts [5]: (i) Number of leptons, Ng = 1, where 
pip > 20 GeV andp^ iso < 0.1 xpj'; (ii) Missing transverse energy, > 20 GeV; (iii) Number of jets, Nj > 3, 
where pj) > 30 GeV and at least one jet has been tightly 6-tagged [J]; (iv) Number of taus, A^ T = for taus 
with?4 T) > 20 GeV 0]. 

With our events selected in this way, we pair up jets (which are not 6-tagged) to fill the same-event and 
bi-event h(rrijj) distributions as described above. Each jet pair must have Ai? > 0.4. To fill the bi-event 
distribution, we refer to jets from the previous event which has passed the same cuts as listed above. Once 
the distributions are filled with all events, we normalize the shape of the h hl (rrijj) distribution as described by 
Eq. 0. Then we perform our BEST. The result of this subtraction can be seen in Fig. [T] which shows a drastic 
reduction in the background obscuring the W boson reconstruction. Note that the bi-event distribution models 
the combinatoric background of any jet pairs which are not correlated by decay chains or event kinematics. 
Thus, BEST in this case removes (i) the combinatoric background from events with W bosons (coming from t 
decays) and (ii) uncorrelated jet pairs coming from our VF+jets background sample. 
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FIG. 1: The dijet invariant mass distribution, rrijj. This plot shows the same-event (m^ mc ), bi-event (m^J), and BEST 
(m?j EST ) distributions as described in the text. The BEST distribution is fitted with a gaussian plus cubic function, to 
find the W boson mass peak and surrounding background. The BEST distribution is also split up into regions for a 
sideband subtraction used for reconstructing an invariant mass between a W boson and a b tagged jet. The W region 
is dark cyan filled, while the sidebands are orange filled. For an integrated luminosity of 2 fb _1 , we find the W boson 
mass, m w = 81.11 ± 0.32 GeV. 
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FIG. 2: The W plus b invariant mass distribution, m^w This plot shows the same-event, bi-event, and BEST distribu- 
tions as described in the text. For an integrated luminosity of 2 fb _1 , we find the top quark mass, m t = 170.5 ± 1.5 GeV. 
The top quark mass is set within ALPGEN as m t = 174.3 GeV. 



Once we have found the W boson with this first application of BEST, we can combine the W boson with a &-jet 
to find the top quark. To remove additional background from the W signal, we perform a sideband subtraction. 
To do this, we split up the dijet signal into a W boson mass region, where 70 GeV < rrijj < 90 GeV, and 
two sideband regions, 40 GeV < mjj < 55 GeV and 105 GeV < m j3 < 120 GeV. We form the dijet (W) 
plus b invariant mass, keeping track of whether the dijet system was in the W window or sideband windows. 
In this way we make the W band (/iwband, BEST^fewO) an d sideband (h SB ' BEST (mf,vi/)) distributions. The 
sideband distribution models the remaining background of W's very well. By fitting the h BEST (rrijj) by a 
gaussian function, /(m^ EST ), plus a background function, g BG (rn BEST ), we can find the shape of the background 
distribution which remains. Then we calculate a normalization factor: 
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Using this normalization factor, we perform the sideband subtraction, 

/jSBsub, BEST (m bw ) = 



/^M/band, BESTOW) - Cf B h SB > BEST ' (m bW ) . 1 * ' 




FIG. 3: The dijet invariant mass distribution, rrijj from our nuSUGRA events mixed with SM backgrounds. The BEST 
has already been performed. The BEST distribution is fitted and split up into regions for a sideband subtraction used 
for reconstructing an invariant mass between a W boson and a leading jet. The W region is dark cyan filled, while the 
sidebands are orange filled. Here we find the W boson mass, mw = 82.4 ±1.0 GeV. This plot is for an integrated 
luminosity of 100 fb _1 . 



This subtraction removes even more of the W combinatoric background. 

Lastly, to remove the combinatoric background of 6-jets, we can perform our BEST again. We form the 
^SBsub, BESTj- TO& ^-j distribution again, this time using b jets which come from a different event as the W. 
Again, this models the combinatoric background very well, since the W and b from different events cannot 
possibly come from a single top quark. We can calculate a normalization factor as before analogous to Eq. (JlJ 
in the range 200 GeV < mbw < 500 GeV (a little away from the top mass peak). Using this normalization 
factor, we can perform the final BEST, analogously to that shown in Eq. ([6]). The resulting m^y/ distribution 
after this last application of BEST is shown in Fig. [2] which shows a very clean looking top peak. 

In the context of top reconstruction, other groups have come up with some techniques to eliminate the 
combinatoric background. In experimental top reconstruction [IHZ], combinatoric background is eliminated 
by assuming a very particular event topology. By selecting certian events, the combinatoric background is 
eliminated by essentially choosing the jet combinations which form the best W and t masses. For SM ti events, 
this reconstruction works quite well to measure the top mass. However, these methods cannot be employed 
to reconstruct t quarks from beyond SM sources. On the other hand, some phenomenological studies of top 
production from beyond SM use top tagging [5] to identify the top correctly, and thus reduce the combinatoric 
background. However, the top tagging relies on the production of a boosted top from the decay of a heavy new 
particle. Also, although this top tagger has a large efficiency, it seems the fake rate from SM backgrounds may 
be large. These experimental and phenomenological techniques may be more precise than BEST (although, a 
thorough study would be needed to compare them). However, the advantage of BEST is that it does not require 
any assumptions about the event topology or having boosted tops. 

This example from the SM shows the power of BEST. Additionally, BEST is useful for searches and mea- 
surements of models beyond the SM. Thus, we also demonstrate the use of BEST for a supersymmetry (SUSY) 
model. The model we choose is the non-universal generalization of the minimal supcrgravity model [S] i.e., 
nuSUGRA. In this nuSUGRA model, the Higgs masses are not unified with the other scalar masses at the 
grand unified scale. This allows for a more general mass spectrum than that of the mSUGRA model. The 
indication of the preference for the nuSUGRA model at the LHC is that the neutralino masses will not have 
the mass ratios predicted by mSUGRA. This nuSUGRA model can also predict the correct amount of dark 
matter in the universe today. In particular, a large parameter space region of this model has an abundance of 
W bosons being produced 10J. These W bosons must be found and utilized to reconstruct the model. Thus, 
this is a useful model to examine with BEST. 
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FIG. 4: The W plus jet invariant mass distribution, rrijw- This plot shows the same-event, bi-event, and BEST 
distributions as described in the text. BEST removes the background obscuring the endpoint. For an integrated 
luminosity of 100 fb _1 , we find the endpoint to be 769 ± 18 GeV. This is within 2a of the theoretical endpoint, which 
is 738.8 GeV for the most probable decay chain of this type, q — >g + X4~ > 9 + W + xf . 

We choose a benchmark point for the nuSUGRA model for this demonstration: mo = 360 GeV, mi/ 2 = 
500 GeV, tan/3 = 40, A = 0, and m H = 732 GeV, with the top mass set as m t = 172.6 GeV. This point in 
parameter space predicts an abundance of W bosons at the LHC due to neutralino or chargino decays. The 
decay chain we wish to partially reconstruct is: q — >• q + %f (xl) 1 + + Xi (xT)- O ur BEST has been 
used to analyze this signal already, with the details shown in |10) . 

To simulate events for this demonstration, we once again use PYTHIA and PGS4. The SUSY mass spectrum 
is generated using ISAJET [11] . We also use ALPGEN to simulate some SM backgrounds. The primary SM 
backgrounds for the events we wish to analyze are Z+jets, W+jets, and ti events. We mix these SM backgrounds 
in randomly with our SUSY signal events. 

To help reduce the SM backgrounds, we use the following selection cuts, which are refined from the cuts in 
[ID] : (i) Missing transverse energy, > 180 GeV; (ii) Number of jets, Nj > 4, where p$) > 30 GeV; (iii) 
Minimum Acf> between leading three jets and missing transverse energy, A(j> mm > 0.5; (iv) Leading jet transverse 

momenta, p^ lst j) > 300 GeV and p { ^ nd j) > 200 GeV; (v) AR between leading jets, Ai?(lst j, 2nd j) < 3.2; 

(vi) Scalar sum, j) + p^ 2nd j) + 3 • fk > 1600 GeV. 

With these event selection cuts, we begin to pair up the sub-leading jets as we did for the tt analysis, 
perform the BEST to find the W bosons, then combine the W's with the leading jets to reconstruct the 
desired decay chain. While pairing up the jets, we use the additional cut 0.4 < AR(jj) < 1.5. We once 
again perform a sideband subtraction to help clean up any excess background involved with finding the W 
bosons. When combining the W candidates (jet pairs) with leading jets, we keep only those combinations 
where AR(W, j) > 1.0. We use BEST again on the leading jet as well, to remove combinatoric background 
from the leading jets which are not from our desired decay chain. The result of this analysis can be seen in 
Figs. [3] and [4] Note in Fig. [3] that the W boson mass peak can barely be seen in the same-event histogram, but 
is clearly visible after the application of BEST. 

In conclusion, BEST is a powerful subtraction technique which can find and reconstruct particles normally 
hidden by the combinatoric background, as shown in Fig. [3j It is useful for the further understanding of the 
SM as well as models beyond the SM. It can be utilized without information about the charge or flavor of the 
particles involved. BEST can, therefore, improve any current and future collider study and help us detect new 
particles, measure their masses and determine model parameters accurately. 
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